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Introduction 


In  spite  of  the  magnitude  of  the  problem  which  prostate  cancer  presents,  our  understanding  of  the  molecular 
mechanisms  underlying  prostatic  carcinogenesis  remains  elusive.  It  is  clear  from  the  recent  progress  made  in 
colorectal,  renal  and  breast  cancer  that  analysis  of  familial  forms  of  common  human  neoplasms  can  yield  tremendous 
insight  into  the  specific  genetic  mechanisms  in  both  hereditary  and  sporadic  forms  of  such  cancers.  Hereditary  factors 
are  estimated  to  be  responsible  for  about  nine  percent  of  all  cases  of  prostate  cancer  in  the  U.S.  Segregation  analysis 
of  familial  prostate  cancer  has  supported  an  autosomal  dominant  mode  of  inheritance  of  prostate  cancer  susceptibility 
alleles  with  some  evidence  for  heterogeneity.  These  findings  provided  the  basis  for  a  recent  genome  wide  scan  for 
linkage  in  multiplex  prostate  cancer  families.  This  analysis  implicated  a  region  of  chromosome  1  as  being  the  most 
likely  region  of  the  genome  to  contain  a  major  prostate  cancer  susceptibility  gene.  Interestingly,  this  evidence  for 
linkage  was  provided  almost  exclusively  by  large  families  (5  or  more  first  degree  relatives  affected/family)  with  an 
early  average  age  of  diagnosis  (<65  years).  However,  there  was  significant  evidence  for  locus  heterogeneity  and  a 
series  of  other  loci  also  showed  evidence  of  linkage,  albeit  to  a  lesser  extent  than  chromosome  1.  It  is  the  goal  of  the 
research  proposed  herein  to  further  analyze  these  non-chromosome  1  regions  for  additional  evidence  of  linkage  to 
prostate  cancer  susceptibility.  To  detect  these  potential  linkages,  57  additional  families,  each  containing  at  least  five 
affected  members  and  over  half  having  an  average  age  of  diagnosis  under  65,  will  be  collected  for  these  studies,  as 
deemed  necessary  from  simulation  analyses.  Genotypic  data  for  these  families  in  the  regions  of  interest  will  be 
analyzed  using  both  parametric  and  non-parametric  methods,  including  conditional  analyses  and  two  locus  models  to 
test  for  gene-gene  interactions.  These  studies  will  provide  the  basis  for  positional  cloning  efforts  to  identify  and 
characterize  prostate  cancer  susceptibility  genes. 


Body 

Listed  below  is  a  summary  of  the  research  objectives  as  described  in  the  approved  Statement  of  Work  as  it  applies  to 
the  first  12  months  of  the  funding  period,  along  with  the  accomplishments  pertaining  to  these  objectives. 

Task  \)  Ascertain  57  additional  families  with  at  least  5  members  with  prostate  cancer  (months  1-30). 

-  contact  all  family  members  in  the  identified  57  targeted  pedigrees,  obtain  informed  consent,  and  arrange  for 
blood  draw  and  shipment  of  samples  to  Johns  Hopkins;  we  anticipate  canying  out  family  collection 
throughout  the  funding  period,  with  a  collection  rate  of  ~25  families  per  year, 
collect  25  families  (months  1-12) 

Accomplishments  related  to  Task  1: 

Within  the  first  year,  we  ascertained  28  of  the  57  families  proposed  in  our  specific  aims.  We  contacted  each  living 
family  member  to  obtain  informed  consent  and  blood  DNA.  Tables  I  &  II  summarize  the  family  collection  and  blood 
DNA  status  respectively: 


Table  I  Family  Collection 


N 

28 

#  Families 

137 

#  Subjects  contacted 

57 

#  Family  members  deceased 

127  (average  4.5  per  family) 

#  Affected 

67  (average  2.5  per  family 

#  Unaffected 

30 

#  Unaffected  Males 

37 

#  Females 

5 


Table  IIBlood  and  Tissue  Block  Collection 


N 

137  (average  4.9  per  family) 

#  Blood  DNA 

#  Tissue  Blocks 

86  (63%) 

#  Affecteds  with  Blood  DNA 

it  Unaffecteds  with  Blood  DNA 

137 

#  Cell  Lines 

Age  ranges  for  subjects  in  28  families  are  summarized  in  Table  III. 


Table  III _ Age  Descriptions 


N 

36  yrs.  -  80  yrs. 

Age  range  of  subjects 

80 

Oldest  age  of  Affecteds 

36 

Youngest  age  Affecteds 

55 

#  Affecteds  diagnosed  >=65 

64 

#  Affecteds  diagnosed  <65 

Task  2)  Genotype  the  new  and  current  sets  of families  for  highly  polymorphic  markers  in  the  chromosomal  regions  for 
which  we  have  preliminary  evidence  of  linkage,  including  4q26,  5ql2,  7p21,  13q32,  andXq28  (months  1-30). 

-  genotype  existing  pedigrees  at  100  new  loci  (months  1-12) 

-  prepare  DNA  from  25  new  pedigrees  (months  1-12) 

-  genotype  25  new  pedigrees  with  new  markers  (months  10-16) 

Accomplishments  related  to  Task  2: 

Genotypes  have  been  generated  for  over  800  individuals  in  the  existing  102  families  for  the  following  sets  of  markers; 
Xq27-28,  40  markers;  8p,  20  markers;  13q,  25  markers;  lq42-43,  6  markers;  lp36,  6  markers,  for  a  total  of  97  loci. 
DNA  has  been  prepared  from  137  individuals  in  the  28  new  families  and  a  subset  of  these  markers  have  been  analyzed 
in  this  dataset. 

Task  3)  Perform  genetic  linkage  analysis  on  the  existing  102  and  57  new  HPC  families  (months  3-30). 

-  carry  out  parametric  and  non-parametric  two-point  and  multipoint  linkage  analyses  on  new  genotypic  data 
collected  from  existing  families  (months  3-16) 

Accomplishments  related  to  Task  3: 

These  analyses  are  ongoing. 
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Key  Research  Accomplishments 


-  ascertainment  of  28  new  HPC  families,  with  an  average  of  4.5  prostate  cases  per  family 

-  collection  of  blood  samples  form  137  individuals  in  these  families,  and  the  preparation  of  DNA  and 
lymphoblastoid  cell  lines  from  these  individuals 

-  genotyping  of  98  marker  loci  on  our  existing  family  collection  and  a  subset  of  the  newly  ascertained 
families 

two-point  and  multipoint  linkage  analyses  of  these  data  are  underway 
preliminary  heterogeneity  analyses  are  underway 

Reportable  Outcomes 

manuscripts 

o  Xu  et  al.  Nat.  Gen.  20: 175, 1998  Evidence  for  a  Prostate  Cancer  Susceptibility  Locus  on  the  X 
Chromosome. 

o  Xu  et  al.  AJHG  in  press  Combined  Analysis  of  Hereditart  Prostate  Cancer  Linkage  to  lq24-25: 
Results  from  772  Hereditary  Prostate  Cancer  Families  from  the  International  Consortium  for 
Prostate  Cancer  Genetics.  Submitted 


Conclusions 

A  cohort  of  28  hereditary  prostate  cancer  families  containing  127  affected  men  have  been  ascertained,  and  blood 
samples  collected.  These  unique  families  are  highly  informative  for  linkage  analysis.  Genotyping  has  been  carried  out 
on  these  and  our  existing  cohort  of  families,  and  linkage  analysis  of  these  data  are  underway.  These  analyses  will 
greatly  increase  our  ability  to  understand  and  characterize  the  genetic  heterogeneity  of  hereditary  prostate  cancer.  It  is 
critical  to  understand  this  aspect  of  HPC  if  we  are  to  develop  meaningful  genetic  tests  to  identify  individuals  at  high 
risk  of  developing  this  disease. 
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Evidence  for  a  prostate  cancer  susceptibility  locus 
on  the  X  chromosome 

Jianfeng  Deborah  Meyers^  Diha  Freije^,  Sarah  Isaacs^,  Kathy  Wiley^,  Deborah  Nusskern^,  Charles  Ewing^, 
Eric  Wilkens^,  Piroska  Bujnovszky^,  G.  Steven  Bova^’^,  Patrick  Walsh^  &  William  Isaacs^’^  {Group  1) 

Johanna  Schleutker^’',  Mika  Matikainen^^,  Teuvo  Tammela^,  Tapio  Visakorpi^  &  Olli-R  Kallioniemi^^  {Group  2) 
Rebecca  Berry^^  Daniel  Schaid^^  Amy  French^,  Shannon  McDonnell^,  Jennifer  Schroeder^,  Michael  Blute^ 

&  Stephen  Thibodeau^  {Group  3) 

Henrik  Gronberg^’',  Monika  Emanuelsson^,  Jan-Erik  Damber^^,  Anders  Bergh^^ 

&  Bjorn-Anders  Jonsson^^  {Group  4) 

Jeffrey  Smith^^’',  Joan  Bailey-Wilson^^,  John  Carpten^^,  Dietrich  Stephan^^,  Elizabeth  Gillanders^^, 

Isaac  Amundson^^,  Tommi  Kainu^^,  Diana  Freas-Lutz^^,  Agnes  Baffoe-Bonnie^^,  Anne  Van  Aucken^^, 

Raman  Sood^^,  Francis  Collins^^,  Michael  Brownstein^^  &  Jeffrey  Trent^^  {Group  5) 

"^These  authors  contributed  equally  to  this  work. 

Over  200,000  new  prostate  cancer  cases  are  diagnosed  in  the  Genetic  mapping  of  the  locus  represents  an  important  initial 
United  States  each  year,  accounting  for  more  than  35%  of  all  step  In  the  identification  of  an  X-linked  gene  implicated  in  the 
cancer  cases  affecting  men,  and  resulting  in  40,000  deaths  annu-  aetiology  of  HPC. 

allyk  Attempts  to  characterize  genes  predisposing  to  prostate  Despite  the  medical  significance  of  prostate  cancer  in  terms 
cancer  have  been  hampered  by  a  high  phenocopy  rate,  the  late  of  morbidity,  mortality  and  health-care  costs,  our  understand¬ 
age  of  onset  of  the  disease  and,  in  the  absence  of  distinguishing  ing  of  the  molecular  determinants  of  prostate  cancer  suscepti- 
clinical  features,  the  inability  to  stratify  patients  into  subgroups  bility  remains  rudimentary.  Epidemiological  studies  supporting 
relative  to  suspected  genetic  locus  heterogeneity.  We  previously  the  existence  of  hereditary  forms  of  prostate  cancer  have  led  to 
performed  a  genome-wide  search  for  hereditary  prostate  cancer  the  initiation  of  genome-wide  searches  for  loci  contributing  to 
(HPC)  genes,  finding  evidence  of  a  prostate  cancer  susceptibility  hereditary  prostate  cancer.  A  previous  scan  for  linkage  resulted 
locus  on  chromosome  1  (termed  HPC1;  ref.  2).  Here  we  present  in  suggestive  evidence  (lod>1.0)  for  prostate  cancer  susceptibil- 
evidence  for  the  location  of  a  second  prostate  cancer  susceptibil-  ity  loci  on  several  chromosomes,  including  Iq,  4q,  5p,  7p,  13q 
ity  gene,  which  by  heterogeneity  estimates  accounts  for  approx-  and  Xq  (ref.  2).  Statistically  significant  evidence  was  achieved 
imately  16%  of  HPC  cases.  This  HPC  locus  resides  on  the  X  onlyfor  the  locus  lq24-2 5  (HPCi).  Subsequent  stratification  of 
chromosome  (Xq27-"28),  a  finding  consistent  with  results  of  pre-  pedigrees  showed  that  families  linked  to  HPCl  tended  to  have 
vious  population-based  studies  suggesting  an  X-linked  mode  of  an  early  mean  age  of  diagnosis  (under  65  years)  and  a  large 
HPC  inheritance.  Linkage  to  Xq27-28  was  observed  in  a  com-  number  of  affected  members  (>4).  Even  in  this  subset,  this 
bined  study  population  of  360  prostate  cancer  families  collected  locus  accounts  for  only  approximately  one-half  of  the  families^, 
at  four  independent  sites  in  North  America,  Finland  and  Swe-  Further,  although  two  confirmatory  studies  have  corroborated 
den.  A  maximum  two-point  lod  score  of  4.60  was  observed  at  linkage  to  HPCl  (refs  4,5),  three  additional  studies  found  no 
DXS1113,  0=0.26,  in  the  combined  data  set.  Parametric  multi-  clear  evidence  for  HPCi -predisposed  disease  in  their  study 
point  and  non-parametric  analyses  provided  results  consistent  populations^"^.  The  disparity  in  these  studies  emphasizes  the 
with  the  two-point  analysis.  Significant  evidence  for  genetic  common  set  of  obstacles  for  linkage  detection  in  hereditary 
locus  heterogeneity  was  observed,  with  similar  estimates  of  the  prostate  cancer,  most  prominently,  a  high  phenocopy  rate  and 
proportion  of  linked  families  in  each  separate  famliy  collection,  genetic  locus  heterogeneity. 


Table  1  •  Characteristics  of  prostate  cancer  families 


JHU 

Mayo 

Tampere 

Umei 

All 

Number  of  families 

139 

123 

57 

41 

360 

Number  of  individuals  typed 

766 

407 

548 

268 

1989 

Number  of  affected  individuals  typed 

452 

314 

137 

117 

1020 

Avg.  number  of  affected/family  (range) 

5.1  (3-17) 

4.0(3-11) 

3.2  (2-9) 

4.5(3-10) 

4.3  (2-17) 

Avg.  number  of  affected  individuals  typed/family  (range) 

3.2  (2-11) 

2.6  (2-6) 

2.4  (2-9) 

2.8  (2-8) 

2.7  (2-11) 

Avg.  age  at  diagnosis  (range) 

64.1  (39-85) 

67.1  (41-93) 

68.2  (45-90) 

68.0  (46-86) 

66.3  (39-93) 

^Center  for  the  Genetics  of  Asthma  and  Complex  Diseases,  University  of  Maryland,  Baltimore,  Maryland  21201,  USA.  Departments  of  ^Urology,  ^Pathology 
and  ^Oncology,  Johns  Hopkins  Medical  Institutions,  Baltimore,  Maryland  21287,  USA.  ^Laboratory  of  Cancer  Genetics,  Institute  of  Medical  Technology, 
University  of  Tampere  and  Tampere  University  Hospital,  Tampere,  Finland.  Departments  of  ^Laboratory  Medicine  and  Pathology,  ^Health  Sciences  Research 
and^Urology,  Mayo  Clinic/Foundation,  Rochester,  Minnesota  55902,  USA.  Departments  of  ^Oncology,  ^^Urology  &Andrology  and  ^^Pathology,  Umed 
University,  Umed,  Sweden.  ^^Prostate  Cancer  Investigation  Group,  National  Human  Genome  Research  Institute,  National  Institute  of  Health,  Bethesda, 
Maryland  20892,  USA.  ^^Population  Science  Division,  Fox  Chase  Cancer  Center,  Philadelphia,  Pennsylvania  19012,  USA.  Correspondence  should  be 
addressed  to  W.I.  (e-mail:  wisaacs@jhmi.edu). 
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Table  2  •  Two-point  parametric  iod  scores 


lod  (6)® 


Marker 

Heterozygosity 

cMb 

JHU  (139) 

Mayo  (123)' 

Tampere  (57) 

Ume§  (41)^* 

All  (360) 

DXS984 

0.74 

140.0 

0.40  (0.36) 

0.31  (0.34) 

0.87  (0.22) 

0.03  (0.44) 

1.00(0.34) 

DXS1232 

0.66 

140.9 

0.28  (0.36) 

0.00  (0.50) 

0.66  (0.22) 

0.24  (0.40) 

DX51205 

0.66 

142.3 

0.19(0.38) 

0.00  (0.50) 

2.05  (0.14) 

0.33  (0.36) 

DX56751 

0.74 

143.6 

0.49  (0.36) 

0.52  (0.32) 

1.56(0.18) 

1.63  (0.32) 

DX56798 

0.83 

144.8 

0.51  (0.36) 

0.78  (0.22) 

0.87  (0.32) 

DXS8106 

0.70 

146.1 

0.82  (0.34) 

0.80  (0.30) 

0.89(0.16) 

1.93  (0.30) 

DXS6806 

0.81 

147.3 

0.45  (0.36) 

0.78  (0.30) 

0.14(0.28) 

0.03  (0.44) 

1.07(0.34) 

DXS8043 

0.83 

148.8 

0.97  (0.32) 

0.02  (0.40) 

0.00  (0.50) 

0.08  (0.38) 

0.74  (0.36) 

AFMA113zf5 

0.68 

149.3 

0.11  (0.36) 

1.24  (0.28) 

1.22  (0.18) 

2.01  (0.28) 

DXS1200 

0.60 

150.4 

1.98(0.28) 

0.86  (0.26) 

0.17(0.32) 

0.00  (0.50) 

2.80  (0.30) 

DXS297 

0.70 

151.0 

0.64  (0.34) 

0.18(0.36) 

0.13  (0.00) 

0.84  (0.34) 

AFM136yb10 

0.68 

152.5 

1.00(0.30) 

0.40  (0.30) 

0.05  (0.38) 

1.38(0.32) 

DXS8091 

0.80 

152.5 

1.52(0.30) 

0.28  (0.34) 

0.00  (0.50) 

1.43  (0.32) 

DXSl  113 

0.80 

153.0 

1.73  (0.28) 

1.89  (0.26) 

0.49  (0.22) 

0.60  (0.26) 

4.60  (0.26) 

DXSl  193 

0.78 

153.3 

0.96  (0.32) 

0.58  (0.26) 

0.34  (0.32) 

1.80(0.30) 

DXS8069 

0.67 

154.5 

0.44  (0.36) 

0.84  (0.30) 

0.01  (0.40) 

0.12(0.38) 

1.20(0.34) 

DXS8011 

0.87 

154.6 

0.32  (0.36) 

0.58  (0.26) 

0.72  (0.34) 

DXS8103 

0.77 

155.2 

0.10(0.42) 

0.38  (0.34) 

0.92  (0.24) 

0.29  (0.32) 

1.10(0.36) 

AFMA225xh9 

0.74 

156.3 

0.31  (0.36) 

0.98  (0.30) 

0.00  (0.50) 

0.68  (0.36) 

AFMA08xa5 

0.51 

157.1 

0.02  (0.44) 

0.02  (0.40) 

0.09  (0.00) 

0.03  (0.42) 

DXSl  108 

0.70 

158.8 

0.12  (0.42) 

0.57  (0.32) 

0.00  (0.50) 

0.42  (0.38) 

^Maximum  lod  score  under  homogeneity  with  the  maximum  likelihood  estimate  of  the  recombination  fraction  (6),  calculated  using 
FASTLINK.  i^Distance  in  cM  from  Xpter.  ‘^Three  markers  were  not  genotyped  in  this  group.  ‘^Thirteen  markers  were  not  genotyped 
in  this  group. 


A  further  confounding  issue  in  prostate  cancer  linkage  studies 
is  the  lack  of  a  clear  delineation  of  the  mode(s)  of  inheritance. 
Segregation  analyses  of  familial  prostate  cancer  have  supported 
an  autosomal  dominant  mode  of  inheritance  for  prostate  cancer 
susceptibility  alleles^"^^  although  formal  testing  of  possible  X 
chromosome  segregation  has  not  been  performed.  On  the  basis 
of  studies  of  prostate  cancer  risk  in  relatives  of  affected  men,  it 
has  been  suggested  that  an  HPC  susceptibility  locus  may  reside 
on  the  X  chromosome.  Several  population-based  studies  have 
reported  a  statistically  significant  excess  risk  of  prostate  cancer 
in  men  with  affected  brothers,  as  compared  with  those  with 
affected  fathers,  consistent  with  the  hypothesis  of  an  X-linked, 
or  recessive,  model  of  inheritance^ In  our  initial  genome¬ 
wide  search  for  prostate  cancer  linkage,  there  was  suggestive  evi¬ 
dence  of  linkage  to  the  X  chromosome^.  These  indications  have 
prompted  a  more  detailed  analysis  of  potential  X-linkage  in 
HPC  families. 

To  carry  out  this  analysis,  we  have  assembled  360  prostate  can¬ 
cer  pedigrees  consisting  of  families  collected  at  sites  in  the  US 
(Johns  Hopkins  University  (JHU)  in  Baltimore,  Maryland  and 
the  Mayo  Clinic  in  Rochester,  Minnesota),  Finland  (University  of 
Tampere,  Tampere)  and  Sweden  (Umea  University,  Umea). 
Characteristics  of  the  various  family  collections  are  given 
(Table  1).  Overall,  these  360  families  contained  1,568  affected 
members.  DNA  samples,  either  from  blood  or  archival  tissue 
samples,  were  available  from  1,020  affected  individuals,  and  from 
an  additional  969  individuals  who  were  either  female  or  unaf¬ 
fected.  Over  one-half  of  the  families  had  at  least  one  case  of 
apparent  male-to-male  disease  transmission.  As  it  is  possible  that 
some  of  these  occurrences  result  from  a  high  phenocopy  rate,  the 
entire  data  set  was  analysed  for  possible  evidence  of  X  linkage. 

The  results  fi-om  our  previous  10- cM  genome- wide  screen 
using  66  North  American  prostate  cancer  families  implicated  a 
40-cM  interval  from  DXSIOOI  to  DXSllOSy  reaching  a  maximum 
two-point  lod  score  of  1.08  at  marker  DXS1193  at  Xq27-28 
(ref.  2).  To  more  rigorously  test  the  hypothesis  of  linkage  to  this 
region,  an  additional  28  markers  were  selected  to  augment  the  five 
original  survey  markers  across  the  X  chromosome  interval.  These 
markers  were  genotyped  to  create  density  map  intervals  of  1.2-cM 


for  the  139  North  American  HPC  families  collected  at  JHU.  A 
subset  of  26  of  these  markers,  spanning  19  cM  from  DXS984  to 
DXS1108  (140-159  cM  firom  Xpter),  were  genotyped  for  the  123 
Mayo  Clinic  and  the  57  Finnish  HPC  families,  and  a  less  dense, 
4-cM  map  of  eight  markers  in  this  interval  was  completed  for  the 
41  Swedish  families.  Allele  frequencies  were  estimated  from  inde¬ 
pendent  individuals  in  the  complete  data  set.  Two-point  paramet¬ 
ric  lod  scores  are  listed  (Table  2).  Twelve  of  the  markers  tested  had 
lod  scores  greater  than  1  in  the  combined  data  set,  with  a  maxi¬ 
mum  score  of  4.6  at  marker  DXSl  1 13,  0=0.26.  These  results  were 
supported  by  non-parametric  affected  sibpair  analysis  (Table  3). 
Fourteen  consecutive  markers  had  an  excess  mean  identical-by- 
descent  (IBD)  sharing  (0.55),  with  the  lowest  P- value  of  0.00006 
at  DXSl  113.  The  lod  score,  on  the  basis  of  sibpair  IBD  sharing, 


Table  3  •  Two-point  affected  sibpair  analysis 


cM^ 

Mean  IBD*^ 

P-value^ 

iod 

DXS984 

140.0 

0.54 

0.08 

0.42 

DXSl  232 

140.9 

0.51 

0.33 

0.04 

DXSl  205 

142.3 

0.53 

0.15 

0.24 

DXS6751 

143.6 

0.56 

0.005 

1.41 

DXS6798 

144.8 

0.55 

0.047 

0.60 

DXS8106 

146.1 

0.57 

0.005 

1.43 

DXS6806 

147.3 

0.55 

0.039 

0.67 

DXS8043 

148.8 

0.55 

0.023 

0.86 

AFMA113zf5 

149.3 

0.58 

0.013 

1.08 

DXS1200 

150.4 

0.60 

0.00008 

3.11 

DXS297 

151.0 

0.56 

0.025 

0.83 

AFM136yb10 

152.5 

0.57 

0.007 

1.28 

DXS8091 

152.5 

0.57 

0.003 

1.63 

DXSl  113 

153.0 

0.60 

0.00006 

3.20 

DXSl  193 

153.3 

0.57 

0.006 

1.37 

DXS8069 

154.5 

0.55 

0.048 

0.60 

DXS8011 

154.6 

0.55 

0.04 

0.65 

DXS8103 

155.2 

0.52 

0.16 

0.20 

AFMA225xh9 

156.3 

0.54 

0.06 

0.50 

AFMAOSxaS 

157.1 

0.52 

0.32 

0.05 

DXSl  108 

158.8 

0.52 

0.21 

0.14 

^Distance  in  cM  from  Xpter.  ‘^Affected  sibpair  analyses  were  performed  using 
ANALYZE.  ‘^All  possible  sibpairs  were  used  in  the  analysis,  however,  a  weight  of 
(n-1)  was  given  to  the  sibship  of  multiple  sibs,  where  n  is  the  number  of  sibs. 
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cM  from  Xpter 


Fig.  1  Graph  of  multipoint  lod  scores  assuming  heterogenity.  The  peak  multipoint  lod  score  of 
3.85  is  located  between  DXS1200  and  DXS297. 


was  3.2  for  this  marker.  When  population-specific  allele  frequen¬ 
cies  were  used,  similar  results  were  obtained. 

Simulation  studies  were  performed  to  estimate  the  probability 
of  obtaining  a  two-point  parametric  lod  score  of  4.6  or  greater,  or 
a  P- value  less  than  0.00006  for  non-parametric  affected  sibpair 
analysis  (mean  test),  at  a  single  marker  on  the  X  chromosome  in 
the  absence  of  linkage  (false  positive  rate).  Among  10,000  repli¬ 
cates  in  the  simulation,  there  were  no  two-point  parametric  lod 
scores  greater  than  4.0,  nor  were  there  any  P-values  less  than 
0.00006  for  affected  sibpair  analysis.  There  were  three  lod  scores 
greater  than  3,  and  only  once  was  there  a  P- value  less  than  0.0001 
among  the  10,000  replicates. 

Results  from  parametric  multipoint  linkage  analyses  were  con¬ 
sistent  with  the  two-point  analyses.  Data  from  the  Swedish  fami¬ 
lies  were  not  included  in  the  multipoint  analysis,  because  only 
eight  markers  were  genotyped  in  this  dataset.  Analysis  was  car¬ 
ried  out  using  a  sliding  multipoint  approach^ and  hetero¬ 
geneity  analysis  was  then  performed  using  HOMOG  (ref.  20). 
The  maximum  lod  score  assuming  heterogeneity  was  3.85, 
occurring  151  cM  from  Xpter,  between  loci  DXS1200  and 
DXS297  (Fig.  1).  Significant  evidence  for  locus  heterogeneity  was 
obtained,  with  the  proportion  (a)  of  families  linked  estimated  at 
160/0  (x2=17.73,  df=l,  P=0.00002;  Table  4). 

Each  study  population  had  positive  two-point  and  multipoint 
lod  scores  for  multiple  markers  in  the  Xq27-28  region  (Tables 
2,4).  Estimates  of  the  proportions  of  linked  families  in  each  col¬ 
lection  ranged  from  15o/o  (JHU)  to  41o/o  (Tampere),  although  the 
differences  among  groups  are  not  statistically  significant 
(x2=0.53,P=0.77). 

As  a  possible  source  of  genetic  heterogeneity,  we  stratified  fami¬ 
lies  into  two  subsets  on  the  basis  of  consistency  with  an  X-linked 
mode  of  inheritance,  using  the  apparent  presence  or  absence  of 
male-to-male  transmission  as  a  single,  surrogate,  stratification 
criterion.  Following  this  stratification,  129  families  without  male- 
to-male  transmission  contribute  disproportionately  to  the  evi¬ 
dence  of  linkage  to  this  region  (maximum  multipoint  lod  score 
assuming  heterogeneity=2.46  at  151  cM  from  Xpter,  estimated 
proportion  linked=19%).  In  contrast,  for  families  with 'male-to- 
male  transmission  (n=190),  the  maximum  lod  score  assuming 
heterogeneity  was  1,47,  also  at  151  cM,  with  a  lower  estimated  pro¬ 
portion  linked  (13%).  Although  this  difference  is  not  statistically 
significant,  the  observed  trend  is  consistent  with  the  hypothesis  of 
X  chromosome  linkage  in  this  data  set.  The  observation  of  positive 


lod  scores  in  families  with  apparent  male-to-male  dis¬ 
ease  transmission  may  result  from  the  presence  of  phe- 
nocopies  as  affected  fathers  or  other  relatives. 

As  there  was  evidence  for  linkage  of  HPC  suscepti¬ 
bility  loci  to  both  lq24-25  (ref.  2)  and  Xq27-28  in 
families  collected  at  JHU,  we  tested  the  hypothesis 
(Hj)  that  there  are  three  types  of  prostate  cancer  fam¬ 
ilies  in  this  cohort:  (i)  a  proportion  of  the  families 
linked  to  Xq27”28;  (ii)  a  proportion  of  the  remaining 
families  linked  to  lq24-25;  and  (iii)  the  rest  linked  to 
neither  region.  Using  the  admixture  test^® 
(HOMOG3R)  with  multipoint  lod  score  data  for  the 
139  families  in  this  group,  significant  evidence  of 
locus  heterogeneity  was  observed  (Table  5).  The  data 
were  made  at  least  360-fold  more  probable  given  the 
hypothesis  (Hj)  that  subsets  of  HPC  families  are 
linked  to  Xq27-28  or  to  HPCl,  and  the  remainder 
unlinked,  than  the  hypotheses  of  either  as  a  sole  locus 
(H2  or  H3).  Multipoint  data  suggested  that  15%  of  the 
families  in  this  group  were  linked  to  the  X  chromo¬ 
some  locus,  and  that  30%  were  linked  to  HPCL  Simi¬ 
larly,  in  the  59  families  in  this  collection  that  are  not 
linked  to  HPCl  (lod<-0.1),  the  multipoint  lod  score  under  het¬ 
erogeneity  is  1.96  for  Xq27-28,  whereas  the  lod  score  is  0.48  in  the 
remaining  80  families. 

Linkage  analysis  is  valuable  for  identification  of  genetic  loci 
predisposing  to  prostate  cancer.  The  presence  of  genetic  hetero¬ 
geneity  both  in  and  across  populations  necessitates  large-scale 
studies  to  provide  significant  statistical  power  to  identify  major 
loci.  Among  the  JHU  study  population,  loci  at  lq24-25  and 
Xq27-28  are  estimated  to  account  for  approximately  30%  and 
15%  of  the  prostate  cancer  families,  respectively.  In  contrast,  of 
these  two  loci,  only  the  X-chromosome  locus  appears  to  have  a 
prominent  role  in  prostate  cancer  predisposition  in  the  Finnish 
study  population,  in  which  a  larger  fraction  of  families  (over 
40%)  are  estimated  to  be  X-linked,  and  HPCl  shows  only  a  mar¬ 
ginal  role  (J.  Schleutker  etal,  in  preparation).  A  similar  situation 
exists  in  the  Mayo  Clinic  data  set,  although  the  proportion  of 
families  linked  to  the  X  chromosome  is  the  same  as  in  the  JHU 
study  population.  From  these  results,  we  anticipate  that  confir¬ 
matory  studies  will  also  encounter  genetic  heterogeneity.  Indeed, 
a  recently  described  factor  contributing  to  the  lack  of  linkage  to 
HPCl  in  several  family  collections  may  be  the  presence  of  an 
increased  proportion  of  X-linked  pedigrees  in  these  cohorts. 
Similarly,  linkage  to  the  X  chromosome  may  be  most  readily 
apparent  upon  stratification  of  pedigrees  by  male-to-male  dis¬ 
ease  transmission  in  these  populations,  although,  as  we  have 
seen,  evidence  for  this  linkage  is  not  restricted  to  particular  sub¬ 
sets  of  this  stratification.  Further,  as  the  major  proportion  of  the 
families  examined  in  this  study  are  not  linked  to  either  HPCl  or 
the  X-chromosome  locus,  and  as  collection  of  additional  study 


Table  4  •  Heterogeneity  test  using  multipoint  lod  score 
for  each  family  collection 


MLE  estimates  (3-unit  support  interval) 


Group 

lod^ 

a 

map  position*’ 

P-value 

JHU 

2.34 

0.15(0.03-0.30) 

152.5(140.0-154.6) 

0.001 

Mayo 

1.03 

0.16(0.01-0.34) 

154.5(140.0-158.8) 

0.029 

Tampere 

2.03 

0.41  (0.08-0.71) 

143.6  040.0-151.0) 

0.002 

All 

3.85 

0.16(0.06-0.26) 

151.0(140.0-153.3) 

0.00002 

^Heterogeneity  test  was  based  on  sliding  multipoint  lod  scores,  using  the 
admixture  test  (HOMOG),  where  lod  is  calculated  assuming  heterogeneity. 
•^Distance  in  cM  from  Xpter. 
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Table  5  •  Admixture  test  using  multipoint  lod  scores  for  Xq27-28  and 
1q24-25  (139  JHU  families) 


%  linked  to 

%  linked  to 

InL 

P~value 

Hypothesis^ 

1q24-25‘> 

Xq27-28'^ 

a1 

a2 

HI 

0.30 

0.15 

16.43 

H2 

0.29 

[0] 

10.52 

11.82  (2) 

0.0027 

H3 

[0] 

0.16 

5.42 

22.02  (2) 

1.6x10-5 

H4 

[0] 

[0] 

0 

32.86  (5) 

4.0x10-7 

^Heterogeneity  test  was  performed  using  the  admixture  test  (H0M0G3R).  ^Multipoint 
lod  scores  at  1q24-25  were  based  on  markers  D1S158  and  D1S422.  ^Multipoint  lod  score 
at  Xq27-28  were  based  on  markers  AFMA1 13zf5,  DXS1200  and  DXS297.  is  -2ln  likeli¬ 
hood  difference  between  HI  and  each  alternative  hypotheses. 


populations  increases  the  statistical  power,  additional  loci  may  be 
proven  to  account  for  a  portion  of  prostate  cancer  predisposition. 
In  this  regard,  a  recent  study  of  47  French  and  German  families 
had  a  multipoint  lod  score,  assuming  heterogeneity,  of  2.2 
(a=50%)  and  two-point  score  of  2.7  at  lq42.2-43  (ref.  8). 

Significance  of  the  results  achieved  here  is  supported  by  several 
different  lines  of  evidence.  Most  importantly,  the  linkage  data 
derived  from  each  of  four  independent  family  collections  provides 
evidence  of  linkage  to  Xq27-28.  When  combined,  this  data  set 
yields  a  maximum  two-point  lod  score  of  4.6,  meeting  the  pro¬ 
posed  criteria  for  significant  linkage^  k  Second,  non-parametric 
methods  supported  this  result  and  provided  a  model-indepen¬ 
dent  significance  level  of  P=0.00006  for  linkage.  Third,  simula¬ 
tions  performed  to  provide  an  empirical  nominal  significance 
level  for  the  observed  linkage  results  never  yielded  a  two-point  lod 
score  greater  than  4.0,  nor  any  P- value  less  than  0.00006  in  10,000 
replicates.  The  data  support  the  newly  identified  locus  as  predis¬ 
posing  to  hereditary  prostate  cancer  at  Xq27-28. 

A  candidate  prostate  cancer  susceptibility  gene  residing  on  the 
X  chromosome  is  the  androgen  receptor  gene^^”^^  (AP).  AP,  how¬ 
ever,  is  located  at  Xql2,  over  50  cM  from  the  region  implicated  in 
this  study.  Furthermore,  direct  assessment  of  linkage  to  AP  in  the 
North  American  families  studied  here  provides  no  evidence  of 
linkage  (unpublished  observations).  Several  genes  at  Xq27-28 
have  been  mapped  (http://www.ncbi.nlm.nih.gov/genemap),  and 
these  and  other  novel  genes  in  the  Xq27“28  region  will  need  to  be 
evaluated  as  candidate  prostate  cancer  susceptibility  genes. 

We  have  presented  evidence  for  linkage  of  a  significant  subset  of 
prostate  cancer  families  to  a  locus  on  Xq27-28.  Contingent  upon 
confirmation,  we  suggest  the  designation  HPCX  for  this  locus. 

Methods 

North  American  families.  Johns  Hopkins  family  collection:  The  79  North 
American  families  that  were  described  in  the  report  of  linkage  to  HPCl 
(ref.  2)  are  included  in  this  study,  as  are  an  additional  60  pedigrees  collect¬ 
ed  at  the  Brady  Urologic  Institute  at  Johns  Hopkins.  A  majority  of  these 
families  were  ascertained  through  referrals  from  physicians;  some  families 
were  recruited  from  earlier  epidemiological  studies^  and  through  news 
articles.  Age  of  diagnosis  of  prostate  cancer  was  confirmed  either  through 
medical  records  or  from  two  other  independent  sources.  All  individuals  in 
this  study  gave  full  informed  consent. 

Mayo  Clinic  family  collection:  The  123  North  American  families  in  this 
collection  were  ascertained  by  a  cancer  family-history  survey,  sent  to  over 
5,000  men  who  underwent  a  radical  prostatectomy  for  clinically  localized 
prostate  cancer  in  the  Department  of  Urology  at  the  Mayo  Clinic  during 
1966-1995  (ref.  11).  Prostate  cancer  diagnosis  and  the  age  of  onset  was 
confirmed  through  medical  records  at  the  Mayo  Clinic  and  elsewhere.  All 
participants  in  this  study  gave  full  informed  consent. 

Finnish  families.  In  Finland,  302  prostate  cancer  families  with  two  or  more 
affected  cases  were  identified  through  referrals  from  physicians,  family 
questionnaires  sent  to  patients,  a  nationwide  registry-based  search  and 


advertisements  in  newspapers,  radio  and  television.  Of  this 
group,  57  families  that  were  informative  for  linkage  analyses  were 
included  in  this  study.  Diagnosis  of  all  prostate  cancer  patients 
was  confirmed  through  hospital  records  or  from  the  Finnish  can¬ 
cer  registry.  All  individuals  participating  in  this  study  gave  full 
informed  consent.  j 

Swedish  families.  Since  1995,  families  with  three  or  more  rela¬ 
tives  affected  with  prostate  cancer  have  been  collected  at  the 
Department  of  Oncology  of  Umea  University,  mainly  from 
referrals  from  urologists  throughout  Sweden.  From  approxi¬ 
mately  300  referrals,  41  families  informative  for  linkage  analysis 
have  been  selected.  Twelve  of  these  families  were  included  in  an 
earlier  report^.  When  blood  samples  were  unavailable,  tissue 
samples  were  collected  from  affected  men  whenever  possible. 

Tissue  samples  were  reviewed  by  an  experienced  pathologist  and 
microdissection  was  performed  to  separate  normal  and  tumour  tissue. 

For  genotyping,  only  normal  tissue  was  used.  All  prostate  cancer  diag-  , 

noses  in  the  families  were  confirmed  by  the  National  cancer  registry  and 
medical  records. 

Genotyping  methods.  Techniques  of  preparing  DNA  and  genotyping  were  , 

as  described^.  Markers  were  derived  from  the  Genome  Database  (Johns 
Hopkins  University  School  of  Medicine).  Marker  data  was  obtained  for  33 
polymorphic  loci  available  in  the  GDB,  spanning  the  approximately  40-cM 
interval  between  DXSIOOI  and  DXS1108.  Order  and  distance  for  these 
markers  was  estimated  from  the  entire  genotype  data  set  using  CRIMAP 
(ref.  26).  The  most  likely  order  thus  determined  agrees  with  the  published 
order^^.  Allele  frequencies  were  estimated  from  genotypes  of  independent 
individuals  in  the  360  families. 

Statistical  methods.  Both  parametric  and  non-parametric  linkage  I 

approaches  were  used  in  this  study.  The  parametric  analysis  used  a  previous  j 

modeP'^  with  regard  to  disease  allele  frequency  (0.003)  and  age-specific  j 

penetrances,  although  affected  men  were  assumed  to  be  carriers  of  an  | 

X-linked,  sex-limited,  dominant  gene.  A  fixed  15%  phenocopy  rate,  that  is, 

P  (non-predisposing  genotype/disease),  was  assumed,  whereas  ail  unaffect-  i 

ed  men  under  75,  and  all  women,  were  assumed  to  be  of  unknown  pheno-  j 

type.  In  men  over  age  75,  the  lifetime  penetrance  of  gene  carriers  was  esti-  , 

mated  to  be  63%,  and  the  lifetime  risk  of  prostate  cancer  for  a  non-carrier  j 

was  1 6%  in  this  age  class.  FASTLINK  (refs  18,19)  and  ANALYZE  (ftp://link-  j 

age.cpmc.columbia.edu/software/analyze)  were  used  for  the  parametric  | 

two-point  analysis.  For  the  non-parametric  analysis,  affected  sibpairs  were 
used  for  the  two-point  analysis  as  implemented  by  ANALYZE,  using  the  ■ 

mean  test  and  likelihood  based  test.  The  mean  test  compares  the  number  of  ! 

alleles  shared  IBD  with  the  number  of  alleles  not  shared  IBD  among  affected  i 

sibpairs.  When  there  are  multiple  sibs  in  a  sibship,  a  weight  of  (n-1 )  is  given  j 

to  the  sibship,  where  n  is  the  number  of  sibs.  When  parents  are  not  geno-  | 

typed,  the  program  computes  the  likelihood  of  each  possible  genotype  for  | 

the  parents,  and  computes  the  number  of  alleles  shared  IBD  in  a  sibpair  as  I 

the  average  over  all  possible  parental  genotype  combinations,  weighted  by  I 

their  conditional  probabilities  given  the  known  data.  | 

The  simulation  study  was  performed  using  FASTSLINK  (ftp://watson.  J 

hgen.pitt.edu/pub).  A  10-allele  marker,  which  represents  the  marker  j 

DXSl  113,  was  simulated  unlinked  to  the  disease  locus  using  the  exact  pedi-  j 

gree  structure  and  availability  of  genotype  information  for  the  360  families  j 

analysed.  The  marker  DXSl  1 13  has  1 5  alleles,  six  of  which  have  frequencies  , 

of  approximately  1%  or  less.  To  make  the  simulation  of  a  large  number  of 
replicates  (10,000)  more  practical,  we  collapsed  the  six  less  frequent  alleles 
into  one  allele. 

The  multipoint  approach  is  critical  in  linkage  analysis  of  a  late  age-of-  ! 

onset  disease  such  as  prostate  cancer,  because  parental  genotypic  data  are 
often  missing,  making  inference  of  IBD  ambiguous.  Additionally,  multi¬ 
point  analysis  is  more  robust  to  misspecification  of  allele  frequencies  and 
statistical  fluctuations  at  individual  loci.  When  more  markers  are  used 
simultaneously  in  the  analysis  (multipoint  analysis),  the  probability  distri¬ 
bution  is  concentrated  on  certain  inheritance  vectors,  thus  the  determina¬ 
tion  of  IBD  is  less  dependent  on  the  marker  allele  frequencies^®.  However, 
multipoint  analyses  of  X-chromosome  marker  data  are  hampered  by  the 
lack  of  fully  functional  X-chromosome  versions  of  the  most  appropriate 
multipoint  analysis  computer  programs  (for  example,  GENEHUNTER). 
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In  this  study,  the  parametric  multipoint  analysis  was  performed  using 
FASTLINK  (LINKMAP;  refs  18,19).  Due  to  computer  memory  con¬ 
straint,  only  4-point  analyses  (disease  locus  against  three  marker  loci) 
were  performed.  A  sliding  multipoint  approach  was  used  as  described 
Briefly,  this  approach  consists  of  sliding  a  group  of  three  loci  down  the 
map  and  analysing  the  disease  locus  only  in  the  interval  between  the  sec¬ 
ond  and  third  marker.  Heterogeneity  analysis  was  then  performed  using 
HOMOG  (ref.  20). 

The  admixture  model  was  used  to  test  several  hypotheses  for  genetic 
locus  heterogeneity  (HOMOG3R;  ref.  20).  is  the  proportion  of  families 
linked  to  the  first  disease  locus  (that  is,  lq24-25),  and  a2  is  the  proportion 
linked  to  the  second  disease  locus  (that  is,  Xq27~28).  Hypothesis  1  (Hj) 
assumes  that  there  are  three  types  of  families  in  the  sample,  (a^  a2  and  1- 
(ai+a2)).  Hypothesis  2  (H2)  assumes  that  there  are  two  types  of  families, 
aj  and  l-ttj.  Hypothesis  3  (H3)  assumes  that  there  are  two  types  of  fami¬ 
lies,  a2  and  l-a2  Hypothesis  4  (H4)  assumes  no  linkage  to  either  disease 
locus  (aj=(X2=0)*  Maximum  likelihood  for  each  of  these  hypotheses  was 
calculated  from  the  data.  Chi-square  (x^)  tests  were  performed  by  calculat¬ 
ing  twice  the  difference  of  the  natural  log  likelihood  between  two  hypothe¬ 
ses,  with  the  degrees  of  freedom  (df)  equal  to  the  difference  in  the  number 
of  parameters  estimated  for  the  two  hypotheses.  The  asymptotic  null  dis¬ 
tribution  of  the  test  statistic  has  not  been  well  investigated,  but  this 
approach  is  conservative^®. 


Stratification  of  families.  The  criteria  used  to  categorize  a  family  as  having 
evidence  of  male-to-male  disease  transmission  were  as  follows:  (i)  presence 
of  affected  father  and  affected  son(s)  combinations,  or  (ii)  prostate  cancer 
case(s)  on  the  paternal  side  of  the  family,  with  no  evidence  of  affected  rela¬ 
tives  on  maternal  side.  Families  that  did  not  meet  these  criteria  were  classi¬ 
fied  as  families  without  evidence  of  male-to-male  transmission. 
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